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Abstract — The energy scaling laws of multihop data fusion 
networks for distributed inference are considered. The fusion 
network consists of randomly located sensors distributed i.i.d. 
according to a general spatial distribution in an expanding region. 
Under Markov random field (MRF) hypotheses, among the class 
of data-fusion policies which enable optimal statistical inference 
at the fusion center using all the sensor measurements, the policy 
with the minimum average energy consumption is bounded below 
by the average energy of fusion along the minimum spanning tree, 
and above by a suboptimal policy, referred to as Data Fusion for 
Markov Random Fields (DFMRF). Scaling laws are derived for 
the energy consumption of the optimal and suboptimal fusion 
policies. It is shown that the average asymptotic energy of the 
DFMRF scheme is strictly finite for a class of MliF models with 
Fuclidean stabilizing dependency graphs. 

Index Terms — Distributed inference, graphical models, Eu- 
clidean random graphs, stochastic geometry and data fusion. 



I. Introduction 

WE consider the problem of distributed statistical infer- 
ence in a network of randomly located sensors taking 
measurements and transporting the locally processed data to 
a designated fusion center The fusion center then makes an 
inference about the underlying phenomenon based on the data 
collected from all the sensors. 

For statistical inference using wireless sensor networks, 
energy consumption is an important design parameter The 
transmission power required to reach a receiver distance d 
away with a certain signal-to-noise ratio (SNR) scales in the 
order of d'^, where 2 < < 6 is the path loss [3J. Therefore, 
the cost of moving data from sensor locations to the fusion 
center, either through direct transmissions or through multihop 
forwarding, significantly affects the lifetime of the network. 
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A. Scalable data fusion 

We investigate the cost of data fusion for inference, and 
its scaling behavior with the size of the network and the 
area of deployment. In particular, for a network of n random 
sensors located at points V„ {Vi, • • • , Vn} in M^, a fusion 
policy TTn maps V„ to a set of scheduled transmissions and 
computations. The average cost (e.g., energy) of a policy is 
given by 



n ^ — ^ 



(1) 
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where i£'i(7r„(V„)) is the cost at node i under policy 7r„. The 
above average cost is random, and we are interested in its 
scalability in random networks as n oo. 

Definition 1 (Scalable Policy): A sequence of policies 
7r:=(7r„)„>i is scalable on average if 

lim E(f (7r„(V„))) = £oo{Tr) < oo 

where the expectation E is with respect to the random locations 
V„, and foo(7r) is referred to as the scaling constant. A 
sequence of policies 7r„ is weakly scalable if 

p lim £(7r(V„))) = £00 (tt) < 00, 

where p lim denotes convergence in probability. It is strongly 
scalable if the above average energy converges almost surely 
and is (mean-squared) scalable if the convergence is in 
mean square. 

Hence, a scalable fusion policy implies a finite average 
energy expenditure even as the network size increases. We 
focus mostly on the scalability of the fusion policies, 
which implies weak and average scalability |4|. Further, we 
are interested in lossless data-fusion policies which enable the 
fusion center to perform optimal statistical inference with the 
best inference accuracy as if all the raw sensor data were 
available. 

To motivate this study, first consider two simple fusion 
policies: the direct transmission policy (DT) in which all 
sensors transmit directly to the fusion center (single hop), and 
the shortest-path (SP) policy, where each node forwards its raw 
data to the fusion center using the shortest-path route without 
any data combination at the intermediate nodes. 

We assume, for now, that n sensor nodes are uniformly 
distributed in a square of area n. It is perhaps not surprising 
that neither of the above two policies is scalable as n 00. 
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For the DT pohcjQ, intuitively, the average transmission range 
from the sensors to the fusion center scales as -y/n, thus 
f (DT(V„)) scales as n'^ . On the other hand, we expect the 
SP policy to have better scaling since it chooses the best multi- 
hop path to forward data from each node to the fusion center. 
However, even in this case, there is no finite scaling. Here, the 
average number of hops in the shortest path from a node to the 
fusion center scales in the order of y/n, and thus, £(SP(V„)) 
scales in the order of y/n. Rigorously establishing the scaling 
laws for these two non-scalable policies is not crucial at this 
point since the same scaling laws can be easily established for 
regular networks when sensor nodes are on two-dimensional 
lattice points. See |5|. 

Are there scalable policies for data fusion? Among all 
the fusion policies not performing data combination at the 
intermediate nodes, the shortest-path (SP) policy minimizes 
the total energy. Thus, no scalable policy exists unless nodes 
cooperatively combine their information, a process known 
as data aggregation. Data aggregation, however, must be 
considered in conjunction with the performance requirements 
of specific applications. In this paper, we assume that optimal 
statistical inference is performed at the fusion center as if all 
the raw sensor data were available, and this places a constraint 
on data aggregation. For instance, it rules out sub-sampling of 
the sensor field, considered in ||6l. 

B. Summary of results and contributions 

In this paper, we investigate the energy scaling laws of 
lossless fusion policies which are allowed to perform data 
aggregation at the intermediate nodes, but ensure that the 
fusion center achieves the same inference accuracy as if all the 
raw observations were collected without any data combination. 
We assume that the underlying binary hypotheses for the 
sensor measurements can be modeled as Markov random fields 
(MRF). 

For sensor locations V,i and possibly correlated sensor 
measurements, finding the minimum energy fusion policy 
under the constraint of optimal inference is given by 

£(7r*(V„)) = inf V f.(7r(V„)), (2) 

where 2t is the set of valid lossless data-fusion policies 

2t:={7r : optimal inference is achieved at the fusion center}. 

In general, the above optimization is NP-hard |7|, and hence, 
studying its energy scaling behavior directly is intractable. 
We establish upper and lower bounds on the energy of this 
optimal policy tt* and analyze the scaling behavior of these 
bounds. The lower bound is obtained via a policy conducting 
fusion along the Euclidean minimum spanning tree (MST), 
which is shown to be optimal when the sensor measurements 
are statistically independent under both hypotheses. The upper 
bound on the optimal fusion policy is established through a 
specific suboptimal fusion policy, referred to as Data Fusion 
over Markov Random Fields (DFMRF). DFMRF becomes 

' The direct transmission policy may not even be feasible, depending on the 
maximum transmission power constraints at the sensors. 



optimal when observations are independent under either hy- 
pothesis, where it reduces to fusion along the MST. For certain 
spatial dependencies among sensor measurements of practical 
significance, such as the Euclidean 1 -nearest neighbor graph, 
DFMRF has an approximation ratio 2, i.e., its energy is no 
more than twice that of the optimal fusion policy, independent 
of the size and configuration of the network. 

We then proceed to establish a number of asymptotic prop- 
erties of the DFMRF policy in Section |IV] including its energy 
scalability, its performance bounds, and the approximation 
ratio with respect to the optimal fusion policy when the sensor 
measurements have dependencies described by a fc-nearest 
neighbor graph or a disc graph (continuum percolation). Ap- 
plying techniques developed in fSl-flTl, we provide a precise 
characterization of the scaling bounds as a function of sensor 
density and sensor placement distribution. These asymptotic 
bounds for DFMRF, in turn, imply that the optimal fusion 
policy is also scalable. Hence, we use the DFMRF policy 
as a vehicle to establish scaling laws for optimal fusion. 
Additionally, we use the energy scaling constants to optimize 
the distribution of the sensor placements. For independent 
measurements conditioned on each hypothesis, we show that 
the uniform distribution of the sensor nodes minimizes the 
asymptotic average energy consumption over all i.i.d spatial 
placements when the path-loss exponent of transmission is 
greater than two (i^ > 2). For v e [0, 2), we show that 
the uniform distribution is, in fact, the most expensiv^ node 
configuration in terms of routing costs. We further show that 
the optimality of the uniform node distribution applies for both 
the lower and upper bounds on the average energy consump- 
tion of the optimal fusion policy under Markov random field 
measurements with fc-nearest neighbor dependency graph or 
the disc dependency graph under certain conditions. 

To the best of our knowledge, our results are the first 
to establish the energy scalability of data fusion for certain 
correlation structures of the sensor measurements. The use of 
energy scaling laws for the design of efficient sensor placement 
is new and has direct engineering implications. The fusion 
policy DFMRF first appeared in [12], and is made precise here 
with detailed asymptotic analysis using the weak law of large 
numbers (WLLN) for stabilizing Euclidean graph functionals. 
One should not expect that scalable data fusion is always 
possible, and at the end of Section |IV] we discuss examples 
of correlation structures where scalable lossless data-fusion 
policy does not exist. 

C. Prior and related work 

The seminal work of Gupta and Kumar [13J on the ca- 
pacity of wireless networks has stimulated extensive studies 
covering a broad range of networking problems with different 
performance metrics. See also L14j . Here, we limit ourselves 
to the related works on energy consumption and data fusion 
for statistical inference. 

Results on scaling laws for energy consumption are limited. 
In lUS), energy scaling laws for multihop wireless networks 
(without any data fusion) are derived under different routing 

^The path-loss exponent for wireless transmissions satisfies u > 2. 
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Strategies. The issue of node placement for desirable energy 
scaling has been considered in fTSl, (TT\, where it is argued 
that uniform node placement, routinely considered in the 
literature, has poor energy performance when there is no 
data fusion. It is interesting to note that, for fusion networks, 
uniform sensor distribution is in fact optimal among a general 
class of distributions. See Section HV-BI 

Energy-efficient data fusion has received a great deal of 
attention over the past decade. See a few recent surveys in 
IITSl . |fT9l . It has been recognized that sensor observations 
tend to be correlated, and that correlations should be exploited 
through data fusion. One line of approach is the use of 
distributed compression with the aim of reconstructing all 
the measurements at the fusion center. Examples of such 
approaches can be found in Il20l - ll22l . 

While sending data from all sensors to the fusion center 
is certainly sufficient to ensure optimal inference, it is not 
necessary. More relevant to our work is the idea of data 
aggregation, e.g., ||23l - l|25l . Finding aggregation policies for 
correlated data, however, is nontrivial; it depends on the 
specific applications for which the sensor network is designed. 
Perhaps a more precise notion of aggregation is in-network 
function computation where certain functions are computed by 
passing intermediate values among nodes [26l - ll29l . However, 
these works are mostly concerned with computing symmetric 
functions such as the sum function, which in general, do not 
satisfy the constraint of optimal statistical inference at the 
fusion center. 

In the context of statistical inference using wireless sensor 
networks, the idea of aggregation and in-network processing 
has been explored by several authors. See Il30l - ll36l . Most 
relevant to our work are [3()l-f34l where the Markovian 
correlation structures of sensor measurements are exploited 
explicitly. These results mostly deal with one-dimensional 
node placements, and do not deal with randomly placed nodes 
or energy scaling laws. 

The results presented in this paper extend some of our 
earlier work in the direction of scaling-law analysis in random 
fusion networks. In |7|, lfT2l . ||37 |. for fixed network size 
and node placement, we analyzed the minimum energy fusion 
policy for optimal inference and showed that it reduces to the 
Steiner-tree optimization problem under certain constraints. 
We also proposed a suboptimal fusion policy called the 
DFMRlil. In Ii38i . we analyzed the optimal sensor density for 
uniform node placement which maximizes the inference error 
exponent under an average energy constraint, and in ||39l . ||40l . 
we derived the error exponent for MRF hypotheses. In fSl, we 
analyzed optimal sensor selection (i.e., sub-sampling) policies 
for achieving tradeoff between fusion costs and inference 
performance. 

The energy scaling laws derived in this paper rely heavily 
on several results on the law of large numbers for geometric 
random graphs. We have extensively borrowed the formula- 
tions and techniques of Penrose and Yukich ifTTI . ||4T1 . See 
Appendix |A] for a brief description and ID, [ID, ll42l for 
detailed expositions of these ideas. 

^The DFMRF policy is refeired to as AggMST in (V), (H). 



II. System Model 

In this paper, we consider various graphs. Chief among 
these are (i) dependency graphs specifying the correlation 
structure of sensor measurements, (ii) network graphs denoting 
the (directed) set of feasible links for communication, and (iii) 
fusion policy digraphs denoting the (directed) links used by 
a policy to route and aggregate data according to a given 
sequence. Note that the fusion policy takes the dependency 
graph and the network graph as inputs and outputs the fusion- 
policy digraph. The dependency and network graphs can be 
independently specified and are in general, functions of the 
sensor locations. 

A. Stochastic model of sensor locations 

We assume that n sensor nodes (including the fusion center) 
are placed randomly with sensor i located at Vi E M^. 
By convention, the fusion center is denoted by i = 1, and 
is located at Vi E R^. We denote the set of locations of 
the n sensors by V„:={ Vi, . . . , For our scaling law 
analysis, we consider a sequence of sensor populations placed 
in expanding square regions of area j and centered at the 
origin e M^, where we fix A as the overall sensor density 
and let the number of sensors n — oo. 

To generate sensor locations Vi, first let Qi [^515]^ 
be the unit-area squar^ and Xi r, 1 < ? < n, be a set 
of n independent and identically distributed (i.i.d.) random 
variables distributed on support Qi according to r. Here, r is 
a probability density function (pdf) on Qi which is bounded 
away from zero and infinity. We then generate Vi by scaling 
Xi accordingly: Vi = y^Xi G Q^. A useful special case is 
the uniform distribution (r = 1). Let Va be the homogeneous 
Poisson distribution on with intensity a > 0. 

B. Graphical inference model: dependency graphs 

We consider the statistical inference problem of simple 
binary hypothesis testing, 'Hq vs. "Hi, on a pair of Markov 
random fields. Under regularity conditions |43|, a MRF is de- 
fined by its (undirected) dependency graph S and an associated 
pdf /(• I S). 

Under hypothesis T-Li^ and sensor location set V„ = 
{Vl, ■ ■ ■ 1 Ki} generated according to the stochastic model in 
Section III-AI we assume that the dependency graph := 
(V„,i?fe) models the correlation among the sensor obser- 
vations. Note that the node location set V„ under the two 
hypotheses are identical. Set Ek is the set of edges of the 
dependency graph S/c, and it defines the correlations of the 
sensor observations, as described in the next section. 

We restrict our attention to prox/m/fy-based Euclidean de- 
pendency graphs. In particular, we consider two classes of 
dependency graphfl the (undirected) fc-nearest neighbor graph 
(fc-NNG) and the disc graph, also known as the continuum 

^The results in this paper hold for r defined on any convex unit area. 

^The fc-nearest neighbor graph (fc-NNG) has edges if i is one of 

the top k nearest neighbors of j or viceversa, and ties are arbitrarily broken. 
The disc graph has edges between any two points within a certain specified 
Euclidean distance (radius). 
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percolation graph. We expect that our results extend to other 
locally-defined dependency structures such as the Delaunay, 
Voronoi, the minimum spanning tree, the sphere of influence 
and the Gabriel graphs. An important property of the afore- 
mentioned graphs is a certain stabilization property (discussed 
in Appendix lAb facilitating asymptotic scaling analysis. 

C. Graphical inference model: likelihood functions 

We denote the measurements from all the n sensors placed 
at fixed locations v„ by Yv„. The statistical inference problem 
can now be stated as the following hypothesis test: 

n 

Uo :[Yv„,V„] ~ /(yv„ | SoK), Ho) J] ^( \/^^'), 

n 

Ui :[Yv„,V„] ^/(yv„ | Si(v„),Hi) J] (3) 

1=1 

where /(yv„ | SkjHk) is the pdf of Yv„ given the de- 
pendency graph Sfe(v„) under hypothesis Hk- Note that the 
sensor locations V„ have the same distribution under either 
hypothesis. Therefore, only the conditional distribution of Yv„ 
given the sensor locations V„ = v„ under each hypothesis is 
relevant for inference. 

Under each hypothesis, the dependency graph specifies 
conditional-independence relations between the sensor mea- 
surements Ii43i 

± Yv„\A^(,;g,) I {Y^(,^g,), V„}, under Hfc, (4) 

where J\f{i; 9k) is the set of neighbors of i in 9fc, and i 
denotes conditional independence. In words, the measurement 
at a node is conditionally independent of the rest of the 
network, given the node locations V„ and the measurements 
at its neighbors in the dependency graph. 

The celebrated Hammersley-Clifford theorem ll44l states 
that, under the positivity conditiorQ, the log-likelihood func- 
tion of a MRF with dependency graph 9fc can be expressed 
as 

-log/(yv„ I 9fc(v„),Hfc) = k = 0,l, (5) 

where Cfe is a collection of (maximal) clique^ in Sfc(v„), the 
functions ipk,c, known as clique potentials, are real valued, 
and not zero everywhere on the support of the distribution of 
Yc- 

We assume that the normalization constant (partition func- 
tion) is already incorporated in the potential functions to 
ensure that (|5]l indeed describes a probability measure. In 
general, it is NP-hard to evaluate the normalization constant 
given arbitrary potential functions [451, but can be carried out 
at the fusion center without any need for communication of 
sensor measurements. 

'The positivity condition rules out degeneracy among a subset of nodes: 
Yl = Y2 . . . = Yfc, wliere it is not required for every node to transmit at 
least once for computation of likelihood ratio. 

'a clique is a complete subgraph, and a maximal clique is a clique which 
is not contained in a bigger clique. 



D. Communication model and energy consumption 

The set of feasible communication links form the (directed) 
network graph denoted by 3\fg(v„), for a given realization 
of sensor locations V„ = v„. We assume that it is con- 
nected but not necessarily fully connected, and that it contains 
the Euclidean minimum spanning tree over the node set 
v„ and directed towards the fusion center vi, denoted by 
DMST(v„;ui). Usually in the literature, in order to incorpo- 
rate the maximum power constraints at the nodes, the network 
graph is assumed to be a disc graph with radius above the 
connectivity threshold llT4l . but we do not Umit to this model. 
Transmissions are scheduled so as to not interfere with one 
other. Nodes are capable of adjusting their transmission power 
depending on the location of the receiver. 

A fusion policy 7r(v„) consists of a transmission schedule 
with the transmitter-receiver pairs and the aggregation algo- 
rithm that allows a node to combine its own and received 
values to produce a new communicating value. We model a 
fusion policy tt by a fusion-policy digraph, 3^t^ := (v„, i^^), 
and contains directed links. A directec0 link denotes 
a direct transmission from i to j and is required to be a 
member in the network graph 3\fg(v„) for transmissions to be 
feasible. If one node communicates with another node k times, 
k direct links are present between these two nodes in the edge 
set i^TT of the fusion policy vr. Since we are only interested 
in characterizing the overall energy expenditure, the order of 
transmissions is not important; we only need to consider the 
associated cost with each link in i^^r and calculate the sum 

cost for TT. 

Nodes communicate in the form of packets. Each packet 
contains bits for at most one (quantized) real variable and other 
overhead bits independent of the network size. We assume 
that all real variable^ are quantized to K bits, and K is 
independent of network size and is sufficiently large that 
quantization errors can be ignored. Thus, for node i to transmit 
data to node j which is distance away, we assume that 
node i spends energ\f^ Without loss of generality, we 

assume 7 = 1. Hence, given a fusion policy ^f^r — (v„, i^^r) 
of network size n, the average energy consumption is given 
by 

5(7r(v„)) = -£:(7r(v„)) = i ^ ^jT, 2 < < 6. 
n n — 

(6) 

The model specification is now complete. 

III. Minimum Energy Data Fusion 

In this section, we present data-fusion policies aimed at 
minimizing energy expenditure under the constraint of optimal 
statistical inference at the fusion center, given in (|2|i. The 
scalability of these policies is deferred to Section IIVI 

*We denote a directed link by and an undirected link by 

'in principle, the raw and aggregated data may require different amount of 

energy for communication, and can be incorporated into our framework, 
'"since nodes only communicate a finite number of bits, we use energy 

instead of power as the cost measure. 
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A. Optimal data fusion: a reformulation 

We consider correlated sensor measurements under the 
Markov random field model. The inference problem, defined in 
(|3]l, involves two different graphical models, each with its own 
dependency graph and associated likelihood function. They do 
share the same node location set V„ which allows us to merge 
the two graphical models into one. 

For a given realization of sensor locations V„ = v„, define 
the joint dependency graph S:=(v„, £'), where E:—Eq [J Ei, 
as the union of the two (random) dependency graphs 9o 
9i. The minimal sufficient statistico is given by the log- 
likelihood ratio (LLR) 1|47J. With the substitution of Q, it 
is given by 

/(yv„ I SoK),Ho) 



/(yv„ I Si(v„),Hi) 



Lsiy^J := log 



beSo 



(7) 



where C is the set of maximal cliques in 9 and the effective 
potential functions (j)c are given by 



0c(yc):= ^ i/JiAya) 

a^Ci ,aCc 



MYb), Vcee. (8) 



Hereafter, we work with (S,ig(yv„)) and refer to the joint 
dependency graph 9 as just the dependency graph. 

Note that the log-likelihood ratio is minimally sufficient ll47l 
(i.e., maximum dimensionality reduction) implying maximum 
possible savings in routing energy through aggregation under 
the constraint of optimal statistical inference. Given a fixed 
node-location set v„, we can now reformulate the optimal 
data-fusion problem in dU as the following optimization 



£(7r*(v„))= inf V £:,(7r(v„)), 



(9) 



where ^Jg is the set of valid data-fusion policies 

^c^:^{tt : Lg(yv„) computable at the fusion center}. 

Note that the optimization in (|9]l is a function of the depen- 
dency graph 9(v„), and that the optimal solution is attained 
by some policy. In general, the above optimization is NP-hard 
El. 

B. Minimum energy data fusion: a lower bound 

The following theorem gives a lower bound on the minimum 
energy in (|9]l, given the joint dependency graph S and the path- 
loss exponent ly. Let MST(v„) be the Euclidean minimum 
spanning tree over a reaUzation of sensor locations V„ ~ v„. 

Theorem 1 (Lower bound on minimum energy expenditure): 
The following results hold: 

"a sufficient statistic is a well-beliaved function of tlie data, wliicli is as 
informative as the raw data for inference. It is minimal if it is a function of 
every other sufficient statistic 1461 . 



1) the energy cost for the optimal fusion policy tt* in ^ 
satisfies 



e(7r*(v„)) > £(MST(v„)): 



E 

eeMST(v„ 



(10) 



2) the lower bound (fTOl l is achieved (i.e., equality holds) 
when the observations are independent under both hy- 
potheses. In this case, the optimal fusion policy vr* 
aggregates data along DMST(v„; vi), the directed min- 
imum spanning tree, with all the edges directed toward 
the fusion center vi. Hence, the optimal fusion digraph 
is the DMST(v„;ui). 
Proof: We first prove part 2), for which we consider the case 
when observations are independent, and the log-likelihood 
ratio is given by 

hAy^) 



where ^ is the marginal pdf at node i under T-Lk- Consider 
MST(v„), whose links minimize ^ |e|'^. It is easy to 

eGTi"ee{vTi) 

check that at the fusion center, the log-likelihood ratio can 
be computed using the following aggregation policy along the 
DMST(v„ ; ui ) as illustrated in Fig[T] each node i computes 
the aggregated variable qt (yv„ ) from its predecessor and sends 
it to its immediate successor. The variable qi is given by the 
summation 



*(yv„):= X! 9i(yv„) + 



(11) 



where Np{i) is the set of immediate predecessors of i in 
DMST(v„;vi). 

To show part 1), we note that any data-fusion policy must 
have each node transmit at least once and that the transmission 
must ultimately reach the fusion center This implies that the 
fusion digraph must be connected with the fusion center and 
the DMST with edge-weight \eY' minimizes the total energy 
under the above constraints. Hence, we have ( fTOl l. □ 

Note that the above lower bound in ( fTOl i is achievable when 
the measurements are independent under both hypotheses. It 
is interesting to note that data correlations, in general, in- 
crease the energy consumption under the constraint of optimal 
inference performance since the log-likelihood ratio in (|7]i 
cannot be decomposed fully in terms of the individual node 
measurements. 

C. Minimum energy data fusion: an upper bound 

We now devise a suboptimal data-fusion policy which gives 
an upper bound on the optimal energy in ^ for any given 
dependency graph 9 of the inference model. The suboptimal 
policy is referred to as Data Fusion on Markov Random 
Fields (DFMRF). It is a natural generalization of the MST 
aggregation policy, described in Theorem [1] which is valid 
only for independent measurements. 

We shall use Fig. |2] to illustrate the idea behind DFMRF. 
Recalhng that the log-likelihood ratio for hypothesis testing 
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«2 = -£-2(1/2) +94+95 

Fusion center 



Fig. 1. The optimal fusion graph DMST for independent observations. 

of Markov random fields is given by O, DFMRF consists of 
two phases: 

1) In the data forwarding phase, for each cHque c in the 
set of maximal cliques C of the dependency graph S, 
a processor, denoted by Proc(c), is chosen arbitrarily 
amongst the members of the clique c. Each node in 
clique c (other than the processor itself) forwards its 
raw data to Proc(c) via the shortest path using links in 
the network graph J^g. The processor Proc(c) computes 
the clique-potential function (f)c (yc ) using the forwarded 
data. 

2) In the data-aggregation phase, processors compute the 
sum of the clique potentials along DMST(v„; wi), the 
directed MST towards the fusion center, thereby deliv- 
ering the log-likelihood ratio in (|7]l to the fusion center 

Hence, the fusion-policy digraph for DFMRF is the union 
of the subgraphs in the above two stages, viz., forwarding 
subgraph (FG(v„)) and aggregation subgraph (AG(v„)). The 
total energy consumption of DFMRF is the sum of energies 
of the two subgraphs, given by 

£(DFMRF(v„)) = 

cee(v„) icc 

+ e(MST(v„)), (12) 

where U'^{i,j]'Hg) denotes the energy consumption for the 
shortest path from i to j using the links in the network graph 
^g(vn) (set of feasible links for direct transmission). Recall 
that the network graph J^g is different from the dependency 
graph S since the former deals with communication while the 
latter deals with data correlation. 

For independent measurements under either hypothesis, 
the maximal clique set 6 is trivially the set of vertices v„ 
itself and hence, DFMRF reduces to aggregation along the 
DMST(v„; ui), which is the optimal policy tt* for independent 
observations. However, in general, DFMRF is not optimal. 
When the dependency graph S in O is the Euclidean 1 -nearest 
neighbor graph, we now show that the DFMRF has a constant 
approximation ratio with respect to the optimal data-fusion 
policy TT* in (|9|l for any arbitrary node placement. 



Theorem 2 (Approximation under 1-NNG dependency l\12]l ): 
DFMRF is a 2-approximation fusion policy when the 
dependency graph S is the Euclidean 1 -nearest neighbor 
graph for any fixed node set v„ e 



£(DFMRF(v„)) 
£(7r*K)) 



< 2. 



(13) 



Proof: Since 1-NNG is acyclic, the maximum clique size is 
2. Hence, for DFMRF, the forwarding subgraph (FG) is the 
1-NNG with arbitrary directions on the edges. We have 



£(FG(v„)) = £(l-NNG(v„)) < £(MST(v„)). 



Thus, 



£(DFMRF(v„)) = £(FG(v„)) + £(AG(v„)), (14) 
<2 £(MST(v„)) < 2£(^*(v„)),(15) 

where the last inequality comes from Theorem [1] □ 
Note that the above result does not extend to general k- 
NNG dependency graphs (k > 1) for finite network size n. 
However, as the network size goes to infinity (n — > 00), we 
show in Section IIV-BI that a constant-factor approximation 
ratio is achieved by the DFMRF policy. 

IV. Energy Scaling Laws 

We now establish the scaling laws for optimal and subop- 
timal fusion policies. From the expression of average energy 
cost in (|6]l, we see that the scaling laws rely on the law of large 
numbers (LLN) for stabilizing graph functionals. An overview 
of the LLN is provided in Appendix |A] 

We recall some notations and definitions used in this 
section. Xi ' r, where r is supported on Qi, the unit 
square centered at the origin 0. The node location-set is 
'^n-~\/j{Xi)2^i and the limit is obtained by letting n — > 00 
with fixed A > 0. 



A. Energy scaling for optimal fusion: independent case 

We first provide the scaling result for the case when the 
measurements are independent under either hypothesis. From 
Theorem [T] the optimal fusion policy minimizing the total 
energy consumption in (|9]l is given by aggregation along the 
directed minimum spanning tree. Hence, the energy scaling is 
obtained by the asymptotic analysis of the MST. 

For the random node-location set V„, the average energy 
consumption of the optimal fusion policy for independent 
measurements is 



£(^*(V„))-£(MST(V„)) = - V \e\r (16) 

n ^-^ 

eeMST(V„) 

Let C(i^;MST) be the constant arising in the asymptotic 
analysis of the MST edge lengths, given by 



C(!/;MST):=E 



E 



ee£;(0;MST(PiU{0})) 



(17) 
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Fusion center 
Processor 
Dependency graph 
Forwarding subgraph (FG) 
Aggregation graph (AG) 



(a) Maximal cliques of depen- (b) Forwarding subgraph com- (c) Aggregation subgraph adds (d) Legend 

dency graph putes clique potentials computed potentials 

Fig. 2. Schematic of dependency graph of Markov random field and stages of data fusion. 



where Va is the homogeneous Poisson process of intensity 
a > 0, and £'(0;MST('Pi U {0})) denotes the set of edges 
incident to the origin in MST(7'i U {0}). Hence, the above 
constant is half the expectation of the power-weighted edges 
incident to the origin in the minimum spanning tree over a 
homogeneous unit intensity Poisson process, and is discussed 
in Appendix |A] in (|42] |. Ahhough ^(j^; MST) is not available 
in closed form, we evaluate it through simulations in Section 

m 

We now provide the scaling result for the optimal fusion 
policy when the measurements are independent based on the 
LLN for the MST obtained in [11 Thm 2.3(ii)]. 

Theorem 3 (Scaling for independent data UllV ): When the 
sensor measurements are independent under each hypothesis, 
the limit of the average energy consumption of the optimal 
fusion policy in (fT6l l is given by 

lim £(7r*(V„))= \~^C{^;MST) [ T{xf~'idx. (18) 

Hence, asymptotically the average energy consumption of 
optimal fusion is a constant (independent of n) in the mean- 
square sense for independent measurements. In contrast, for- 
warding all the raw data to the fusion center according to the 
shortest-path (SP) policy has an unbounded average energy 
growing in the order of ^/n. Hence, significant energy savings 
are achieved through data fusion. 

The scaling constant for average energy in ( fTSl l brings out 
the influence of several factors on energy consumption. It is 
inversely proportional to the node density A. This is intuitive 
since placing the nodes with a higher density (i.e., in a smaller 
area) decreases the average inter-node distances and hence, 
also the energy consumption. 

The node-placement pdf r influences the asymptotic energy 
consumption through the term 




When the placement is uniform (r = 1), the above term 
evaluates to unity. Hence, the scaling constant in ( fTSl l for 
uniform placement simplifies to 

A-^C('^;MST). 

The next theorem shows that the energy under uniform node 
placement (r = 1) optimizes the scaling limit in ( fTSl l when 
the path-loss exponent v > 2. Also, see Fig|3] 



2 5 




°0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 

Path-loss exponent ^ 

Fig. 3. Ratio of energy consumption under node placement distribution r 
and uniform distribution as a function of path-loss exponent u. See )19t and 

a. 

Theorem 4 (Minimum energy placement: independent case): 
For any pdf t supported on the unit square Qi, we have 

/ Tixf-'idx > 1, Vj/ > 2, (19) 
Jqi 

I T{xy-^dx < 1, Vj/e[0,2). (20) 
JQi 

Proof: We have the Holder inequality 

||/l/2||l<||/l||p||/2||„ Vp>l,q=^, (21) 

where for any positive function /, 

ll/llp:=(/ fixrdx)K 

When 1/ > 2, in ( 1211 1. substitute /i(x) with t(x)p, /2(a;) with 
T(a;)~p, and p with -^j^ > 1 which ensures that p > 1, to 
obtain (fT9] l. 

For v £ [0,2), in (l2T]i, substitute fi{x) with t{x)p, f2{x) 
with 1, p = 2^ > 1 to obtain (|20l). □ 

The above result implies that, in the context of i.i.d. node 
placements, it is asymptotically energy-optimal to place the 
nodes uniformly when the path-loss exponent v > 2, which 
is the case for wireless transmissions. The intuitive reason 
is as follows: without loss of generality, consider a clustered 
distribution in the unit square, where nodes are more likely 
to be placed near the origin. The MST over such a point set 
has many short edges, but a few very long edges, since a few 
nodes are placed near the boundary with finite probability. 
On the other hand, for uniform point sets, the edges of the 
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MST are more likely to be all of similar lengths. Since for 
energy consumption, we have power-weighted edge-lengths 
with path-loss exponent i' > 2, long edges are penalized 
harshly, leading to higher energy consumption for clustered 
placement when compared with uniform node placement. 

B. Energy scaling for optimal fusion: MRF case 

We now evaluate the scaling laws for energy consumption 
of the DFMRF policy for a general Markov random field 
dependency among the sensor measurements. The DFMRF ag- 
gregation policy involves the cliques of the dependency graph 
which arise from correlation between the sensor measure- 
ments. Recall that the total energy consumption of DFMRF 
in ( fT2b for random sensor locations V„ is given by 



where we use the property that the multihop shortest-path route 
from each node i to Proc(c) consumes no more energy than 
the direct one-hop transmission. 

In the DFMRF poUcy, recall that the processors are mem- 
bers of the respective cliques, i.e., Proc(c) C c, for each clique 
c in the dependency graph. Hence, in dZSl l. only the edges of 
the processors of all the cliques are included in the summation. 
This is upper bounded by the sum of all the power-weighted 
edges of the dependency graph S(V„). Hence, we have 

£(FG(V„))<7. I^r- (26) 

ees(v„) 

From ( l22b . for the total energy consumption of the DFMRF 
policy, we have the upper bound. 



£(DFMRF(V„)) 



^£-(^,Proc(c);:K,) 

cee(v„) icc 

+ £(MST(V„)), (22) 



where £^''(i, j ; Ng) denotes the energy consumption for the 
shortest path between i and j using the links in the network 
graph INfg(V„) (set of feasible links for direct transmission). 

We now additionally assume that the network graph 
INfg(V„) is a local u-energy spanner. In the literature |48|, a 
graph ?sf(,(V„) is called a u-energy spanner, for some constant 
u > called its energy stretch factor, when it satisfies 



max -- — , . . 

MGV„ £^-(z,j;Cg) 



(23) 



where Cg(V„) denotes the complete graph on V„. In other 
words, the energy consumption between any two nodes is 
no worse than u-times the best possible value, i.e., over the 
shortest path using links in the complete graph. Intuitively, the 
u-spanning property ensures that the network graph possesses 
sufficient set of communication links to ensure that the energy 
consumed in the forwarding stage is bounded. Examples of 
energy u-spanners include the Gabriel grapl{3 (with stretch 
factor u = 1 when the path-loss exponent v > 2), the Yao 
graph, and its variations |48|. In this paper, we only require 
a weaker version of the above property that asymptotically 
there is at most u-energy stretch between the neighbors in the 
dependency graph 



lim sup max 



£-(*,j;?^,(V„)) 



< u. 



(-j)GS(v„) £^^(z,j;Cg(V„)) 
From ([24b . we have 



£(FG(V„)) < w E E^"(*'Pi"°'^(^)'^s)' 

cGe(V„) iCc 

cee(v„) icc 



(24) 



(25) 



'^The longest edge in Gabriel graph is 0{\/log n), the same order as that 
of the MST 149]. Hence, the maximum power required at a node to ensure 
M-energy spanning property is of the same order as that needed for critical 
connectivity. 



£(DFMRF(V„)) < M Y |er + £(MST(V„)). (27) 

eeg(v,.) 

The above bound allows us to draw upon the general methods 
of asymptotic analysis for graph functionals presented in ifTTI . 
150|. 

From ( l27b . the DFMRF policy scales whenever the right- 
hand side of (|26] | scales. By Theorem [3] the energy consump- 
tion for aggregation along the MST scales. Hence, we only 
need to establish the scaling behavior of the first term in ( |26] |. 

We now prove scaling laws governing the energy con- 
sumption of DFMRF and we also establish its asymptotic 
approximation ratio with respect to the optimal fusion policy. 
This in turn also establishes the scaling behavior of the optimal 
policy. 

Theorem 5 (Scaling of DFMRF Policy): When the depen- 
dency graph S of the sensor measurements is either the k- 
nearest neighbor or the disc graph, the average energy of 
DFMRF policy satisfies 



limsup£(DFMRF(V„)) 

n— )-oo 

'<limsup(- J2 ?^|er + £(MST(V„)) 



eeS(v„ 



2 



-{x)dx 



J:(0,j)GS(Pa.(x)U{0}) 

+ A"^C(i^;MST) / T{xy'^dx. (28) 
jQi 

Proof: See Appendix iB] □ 
Hence, the above result establishes the scalability of the 
DFMRF policy. In the theorem below, we use this result to 
prove the scalability of the optimal fusion policy and establish 
asymptotic upper and lower bounds on its average energy. 

Theorem 6 (Scaling of Optimal Policy): When the depen- 
dency graph S is either the fc-nearest neighbor or the disc 
graph, the limit of the average energy consumption of the 
optimal policy tt* in ^ satisfies the upper bound 



limsup£(7r*(V„)) <' limsup £(DFMRF(V„)), (29) 
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where the right-hand side satisfies the upper bound in ( |28T l. 
Also, TT* satisfies the lower bound given by the MST 



The approximation ratio of DFMRF satisfies 



liminf £(DFMRF(V„)) liminf £(7r*(V„)) 



"> \im £(MST(V„)) = A-^^(j,;MST) I T{xy-^dx.(30) 



Proof: From (fTOb . the DFMRF and the optimal policy satisfy 
the lower bound given by the MST. □ 
Hence, the limiting average energy consumption for both the 
DFMRF policy and the optimal policy is strictly finite, and is 
bounded by ( |28] | and ( l30l l. These bounds also establish that the 
approximation ratio of the DFMRF policy is asymptotically 
bounded by a constant, as stated below. Define the constant 
P := p{u,\,T,i^), given by 



1 



/[ E \0,jr\T{x)dx 
3:(0J)eSCPAx(.)U{0}) 

A-fCli'iMST) / T{x)^-'idx 



p:=l + 



Lemma 1 (Approximation Ratio for DFMRF): The 
approximation ratio of DFMRF is given by 



(31) 



lim sup 



£(DFMRF(V„)) 

£(7r*(V„)) 



a.s. 

< 



£(DFMRF(V„)) 

lim sup — p/^joT^/^r ^^ — = P' 



(32) 



£(MST(V„)) 
where p is given by ( l3Tl i. 

Proof: Combine Theorem |5] and Theorem |6] □ 
We further simplify the above results for the fc-nearest 
neighbor dependency graph in the corollary below by ex- 
ploiting its scale invariance. The results are expected to hold 
for other scale-invariant Euclidean stabilizing graphs as well. 
The edges of a scale-invariant graph are invariant under a 
change of scale, or put differently, 9 is scale invariant if 
scalar multiplication by any positive constant a from 9(V„) 
to S(aV„) induces a graph isomorphism for all node sets V„. 

Along the lines of ([TT]), let ({i/; fc-NNG) be the constant 
arising in the asymptotic analysis of the fc-NNG edge lengths, 
that is 



((i^; fc-NNG) :=E 



E 



j:(0,j)efe-NNGCPiU{0}) 



(33) 



Corollary 1 (k-NNG Dependency Graph): We obtain a 
simplification of Theorem |5] and |6] for average energy 
consumption, namely 



£(DFMRF(V„)) aj. £(DFMRF(V„)) 
lim sup ^^^^ < Imi sup 



£(^*(v„)) 



(1 



£(MST(V„)) 
C(^; fc-NNG) ^ 
" C(-;MST) )■ ^^^^ 



Proof This follows from [TT, Thm 2.2]. □ 

Hence, the expressions for the energy scaling bounds and 
the approximation ratio are further simplified when the depen- 
dency graph is the fc-nearest neighbor graph. A special case 
of this scaling result for the 1 -nearest-neighbor dependency 
under uniform node placement was proven in |f38l Thm 2]. 

It is interesting to note that the approximation factor for the 
fc-NNG dependency graph in dSSl l is independent of the node 
placement pdf r and node density A. Hence, DFMRF has the 
same efficiency relative to the optimal policy under different 
node placements. The results of Theorem |4] on the optimality 
of the uniform node placement are also applicable here, but 
for the lower and upper bounds on energy consumption. We 
formally state it below. 

Theorem 7 (Minimum energy bounds for k-NNG): 
Uniform node placement (r = 1) minimizes the asymptotic 
lower and upper bounds on average energy consumption 
in dSOl l and ( |34] | for the optimal policy under the fc-NNG 
dependency graph over all i.i.d. node placement pdfs t. 
Proof: From Theorem l4l and (l34li. □ 

We also prove the optimality of uniform node-placement 
distribution under the disc-dependency graph, but over a 
limited set of node placement pdfs r. 

Theorem 8 (Minimum energy bounds for disc graph): 
Uniform node placement (t = 1) minimizes the asymptotic 
lower and upper bounds on the average energy consumption 
in (|30] | and ( l34l l for the optimal fusion policy under the 
disc dependency graph over all i.i.d. node-placement pdfs r 
satisfying the lower bound 



t{x) > Vx e Qi, 



(36) 



where A > 1 is the (fixed) node placement density. 
Proof: We use the fact that for the disc graph S with a 
fixed radius, more edges are added as we scale down the area. 
Hence, for Poisson processes with intensities Ai > A2 > 0, 



j:(oj)eSCPAiU{o}) 



> 



E lO'^r' 

j:(oj)eg(PA2u{o}) 





■A2" 




.Ai. 



lim sup £(7r*(Vn)) 



a.s. 

< 



a.s. 

< lim sup 

n—^oo ^ ^ 



E 

eGS(V„) 



limsup£(DFMRF(V„)) 

n— ^00 

M|er + £(MST(V„)); 



r ^ 



[u C(!^; fc-NNG) + C('^; MST)] / T(a;)i"^da;.(34) 



where the right-hand side is obtained by merely rescaling the 
edges present under the Poisson process at intensity A2. Since, 
new edges are added under the Poisson process at Ai, the 
above expression is an inequality, unlike the case of fc-NNG 
where the edge set is invariant under scaling. Substituting Ai 
with \t{x), and A2 by 1 under the condition that \t{x) > 1, 
Vx G Qi, we have 
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J2 \0,jr]T{x)dx 



E 



1 j:(0j)eS(7'A.(a.)U{0}) 



> A"2E 



J2 |0,jrl / r{x)^-Ux 



j:(0,j)eS(PlU{0}) 



> A"2E 



j:(0j)eS(7'lU{0}) 



> 2. 



Hence, uniform node placement is optimal in terms of the 
energy scaling bounds under the disc dependency graph if we 
restrict to pdfs r satisfying ( |36] |. 

We have so far established the finite scaling of the average 
energy when the dependency graph describing the correlations 
among the sensor observations is either the fc-NNG or the 
disc graph with finite radius. However, we cannot expect 
finite energy scaling under any general dependency graph. For 
instance, when the dependency graph is the complete graph, 
the log-likelihood ratio in (|7]i is a function of only one clique 
containing all the nodes. In this case, the optimal policy in (|9]l 
consists of a unique processor chosen optimally, to which all 
the other nodes forward their raw data along shortest paths, 
and the processor then forwards the value of the computed log- 
likelihood ratio to the fusion center Hence, for the complete 
dependency graph, the optimal fusion policy reduces to a 
version of the shortest-path (SP) routing, where the average 
energy consumption grows as y/n and does not scale with n. 

V. Numerical Illustrations 



As described in Section III-AI n nodes are placed in area 
J and one of them is randomly chosen as the fusion center. 
We conduct 500 independent simulation runs and average the 
results. We fix node density A = 1. We plot results for two 
cases of dependency graph, viz., the A:-nearest neighbor graph 
and the disc graph with a fixed radius S. 

In FigHl we plot the simulation results for the /c-nearest 
neighbor dependency graph and uniform node placement. 
Recall in Corollary [T] we established that the average energy 
consumption of the DFMRF policy in ( |34] | is finite and 
bounded for asymptotic networks under fc-NNG dependency. 
On the other hand, we predicted in Section lTAl that the average 
energy under no aggregation (SP policy) increases without 
bound with the network size. The results in Fig|4a| agree with 
our theory and we note that the convergence to asymptotic 
values is quick, and occurs in networks with as little as 
30 nodes. We also see that the energy for DFMRF policy 
increases with the number of neighbors fc in the dependency 
graph since the graph has more edges leading to computation 
of a more complex likelihood ratio by the DFMRF policy. 

We plot the approximation ratio of the DFMRF policy for 
fc-NNG in (|35] | against the number of nodes in Fig|4b] and 
against the path-loss exponent ly in FigHc] As established by 
Corollary [H the approximation ratio is a constant for large 
networks, and we find a quick convergence to this value in 
FigHSas we increase the network size. In Fig|4cl we also find 



that the approximation ratio is fairly insensitive with respect 
to the path-loss exponent i^. 

In Fig|5al we plot the average energy consumption of 
DFMRF in ( |28] | under uniform node placement and the 
disc dependency graph with radius 6. The average energy is 
bounded, as established by Theorem|5] As in the fc-NNG case, 
on increasing the network size, there is a quick convergence 
to the asymptotic values. Moreover, as expected, energy con- 
sumption increases with the radius 6 of the disc graph since 
there are more edges. Note that the energy consumption at 
5 = and S = 0.3 are nearly the same, since at S = 0.3, the 
disc graph is still very sparse, and hence, the energy consumed 
in the forwarding stage of the likelihood-ratio computation is 
small. 

We now study the effect of i.i.d. node-placement pdf r on 
the energy consumption of both DFMRF policy and shortest- 
path policy with no data aggregation. In Fig|5bl Fig|5c] and 
Fig|7] we consider a family of truncated-exponential pdfs Ta 
given by 



Ta{x)^Ux{l))Ux{2)), xeR\ 



(37) 



where, for some a 7^ 0, is given by the truncated exponential 




ifze[-i,i] 



o.w. 



(38) 



Note that as a— >-0, we obtain the uniform distribution in the 
limit (to = 1). A positive a corresponds to clustering of the 
points with respect to the origin and viceversa. In Fig|6] a 
sample realization is shown for the cases a = ±5 and a— >0. 

Intuitively, for shortest-path (SP) policy where there is no 
data aggregation, the influence of node placement on the 
energy consumption is fairly straightforward. If we cluster the 
nodes close to one another, the average energy consumption 
decreases. On the other hand, spreading the nodes out towards 
the boundary increases the average energy. Indeed, we observe 
this behavior in Fig|7] for the placement pdf Tq defined above 
in dSTT i and (l38T l. However, as established in the previous 
sections, optimal node placement for the DFMRF policy does 
not follow this simple intuition. 

In Theorem |4] we established that the uniform node place- 
ment (to = 1) minimizes the asymptotic average energy 
consumption of the optimal policy (which turns out to be the 
DFMRF policy), when the path-loss exponent > 2. For 1/ G 
[0,2], the uniform distribution has the worst-case value. This is 
verified in Fig|5b] where for £ [1,3], the uniform distribution 
initially has high energy consumption but decreases as we 
increase the path-loss exponent i^. We see that at threshold of 
around u = 2.4, the uniform distribution starts having lower 
energy than the non-uniform placements (clustered and spread- 
out), while according to Theorem H] the threshold should be 

= 2. Moreover, Theorem|4]also establishes that the clustered 
and spread-out distributions (a ± 5) have the same energy 
consumption since the expressions Jg^ Ta{xy^^ dx for a = 5 
and a = — 5 are equal for t^ given by ( [37] ) and ( [38] ), and this 
approximately holds in FigfSb] 
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Fig. 4. Average energy consumption for DFMRF policy and shortest-path routing for uniform node distribution and fc-NNG dependency over 500 runs. 
Node density A = 1. See Corollary [T] 
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(a) Disk graph, u = 2, uniform (r = 1). (b) Avg. energy vs. path loss, 5 = 0, n = 190. (c) Avg energy vs. disk radius, u = 4. 



Fig. 5. Average energy consumption for DFMRF policy over 500 runs for node-placement pdfs shown in Fig|6]under disc-dependency graph with radius 5. 
Node density A = 1. See Theorem [5] 



(a) Uniform a — > 0. (b) Clustered a = 5. (c) Spread-out a = —5. 
Fig. 6. Sample realization of n = 190 points on unit square. See )37t , )38t . 
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Fig. 7. Average energy for shortest-path routing policy over 500 runs for 
node-placement pdfs shown in Fig|6]and number of nodes n = 190. 

We now study the energy consumption of the DFMRF 
poHcy in Fig |5c] under the disc dependency graph and the node 
placements given in Fig|6] In Fig|5c] for path-loss exponent 
ly — i, we find that the uniform node placement (tq = 1) 



performs significantly better than the non-uniform placements 
for the entire range of the disc radius S. Intuitively, this is 
because at large path-loss exponent i^, communication over 
long edges consumes a lot of energy and long edges occur with 
higher probability in non-uniform placements (both clustered 
and spread-out) compared to the uniform placement. Hence, 
uniform node placement is significantly energy-efficient under 
high path-loss exponent of communication. 

VI. Conclusion 

We analyzed the scaling laws for energy consumption of 
data-fusion policies under the constraint of optimal statisti- 
cal inference at the fusion center. Forwarding all the raw 
data without fusion has an unbounded average energy as 
we increase the network size, and hence, is not a feasible 
strategy in energy-constrained networks. We established finite 
average energy scaling for a fusion policy known as the 
Data Fusion for Markov Random Fields (DFMRF) for a class 
of spatial correlation model. We analyzed the influence of 
the correlation structure given by the dependency graph, the 
node placement distribution and the transmission environment 
(path-loss exponent) on the energy consumption. 

There are many issues which are not handled in this paper 
Our fusion policy DFMRF needs centralized network infor- 
mation for constructed, and we plan to investigate distributed 
policies when only local information is available at the nodes. 
Our model currently only incorporates i.i.d. node placements 
and we expect our results to extend to the correlated node 
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placement according to a Gibbs point process through the 
resuhs in fST]. We have not considered here the scaUng 
behavior of the inference accuracy (error probabiHty) with 
network size, and this is a topic of study in ||39l . Il40l . We 
have not considered the time required for data fusion, and 
it is interesting to establish bounds in this case. Our current 
correlation model assumes a discrete Markov random field. A 
more natural but difficult approach is to consider Markov field 
over a continuous space L52J and then, sample it through node 
placements. 
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Appendix 
A. Functionals on random points sets 

In fTTl, fTTl, IH, Penrose and Yukich introduce the 
concept of stabilizing functionals to establish weak laws of 
large numbers for functionals on graphs with random vertex 
sets. As in this paper, the vertex sets may be marked (sensor 
measurements constituting one example of marks), but for 
simplicity of exposition we work with unmarked vertices. We 
briefly describe the general weak law of large numbers after 
introducing the necessary definitions. 

Graph functionals on a vertex set V are often represented 
as sums of spatially dependent terms 

where V C is locally finite (contains only finitely many 
points in any bounded region), and the measurable function 
^, defined on all pairs (x,V), with x G V, represents the 
interaction of x with other points in V. We see that the 
functionals corresponding to energy consumption can be cast 
in this framework. 

When V is random, the range of spatial dependence of ^ 
at node x G V is random, and the purpose of stabilization 
is to quantify this range in a way useful for asymptotic 
analysis. There are several similar notions of stabilization, but 
the essence is captured by the notion of stabilization of ^ 
with respect to homogeneous Poisson points on R^, defined 
as follows. Recall that Va is a homogeneous Poisson point 
process with intensity a > 0. 

We say that ^ is translation invariant if (_{x, V) — £^{x + 
z, V + z) for all z S K^. Let denote the origin of and 
let Br{x) denote the Euclidean ball centered at x with radius 
r. A translation-invariant ^ is homogeneously stabilizing if for 
all intensities a > there exists almost surely a finite random 
variable R :— R{a) such that 

f (0, (Va n BRio)) uA)^ C(o, Va n 5,^(0)) 

for all locally finite C R^ \ Br{0). Thus ^ stabilizes if the 
value of ^ at is unaffected by changes in point configurations 
outside Bfi{0). 

^ satisfies the moment condition of order p > if 




Normalized sum of edges Expectation of edges 

of origin of Poisson process 

ces(v„) ceE(o.sCPxu{o})) 

Fig. 8. LLN for sum graph edges on uniform point sets (r = 1). 



supE[^(n^Xi,n*{Xj^Li)^] < oo. (39) 

We use the following weak laws of large numbers through- 
out. Recall that Xi are i.i.d. with density r. 

Theorem 9 (WLLN SB, 0Ol/ ).- Put g = 1 or g = 2. Let ^ 
be a homogeneously stabilizing translation-invariant functional 
satisfying the moment condition ( [39] l for some p > q. Then 

i=i * * 

= / m{^,'Pxr[x))]T{x)dx in i«. (40) 
■JQi 

We interpret the right-hand side of the above equation 
as a weighted average of the values of ^ on homogeneous 
Poisson point processes Vxt{x)- When ^ satisfies scaling such 
as E [^(0, Va)] ^ a""E [^(0, Vi)], then the Hmit on the right- 
hand side of (l40l l simplifies to 

A-"E[e(0,7'i)] / (r(x))i^"dx in L", (41) 

a limit appearing regularly in problems in Euclidean combina- 
torial optimization. For uniform node placement {t{x) = 1), 
the expression in (l40t reduces to E[^(0,7'a)], and the LLN 
result for this instance is pictorially depicted in Fig|8] 

For example, if ^(a:,V) is one half the sum of the v- 
power weighted edges incident to x in the MST (or any scale- 
invariant stabilizing graph) on V, i.e., 

C(x,V):=i J2 Ier, 

ee-E(a;,MST(V)) 

then substituting a with ^ in (|4TI ). 

2—1 

= \-^E[aO,Vi)] f {T{x)y-^dx 
JQi 

= A-*C('^;MST) [ {T{x))^''idx, (42) 
where MST) is defined in (HT]). 
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B. Proof of Theorem |5] 

The energy consumption of DFMRF satisfies the inequality 
in ( |28] |. For the MST we have the result in Theorem [3] We 
now use stabilizing functionals to show that 



- T. 

n ^ — ' 



eGg(V„) 

converges in to a constant. For all locally finite vertex sets 
A" C supporting some dependency graph ^{X) and for all 
X & X, define the functional 77(2:, X) by 



'n{x,X): 



y:(a:,y)eS(Ar) 



(43) 



Notice that Y.xex vi^, X)^2 EeeS(X) \^\"- 

From ifTTl Thm 2.4], the sum of power-weighted edges of 
the fc-nearest neighbors graph is a stabilizing functional and 
satisfies the bounded-moments condition ( [39] ). Hence, the limit 
in ( |40| ) holds when the dependency graph is the fc-NNG. 

Finally, the sum of power-weighted edges of the continuum 
percolation graph is a stabilizing functional which satisfies the 
bounded-moments condition ( [39l ), thus implying that the limit 
in (gOl) holds. 

Indeed, 77 stabilizes with respect to Va, a e (0, 00), since 
points distant from x by more than the deterministic disc 
radius do not modify the value of ri{x,'Pa)- Moreover, 
satisfies the bounded moments condition ( [39] l since each \x,y\ 
is bounded by the deterministic disc radius and the number of 
nodes in 712 {X^jf^]^ which are joined to n^Xi is a random 
variable with moments of all orders. □ 
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